A GMM supervector approach for spoken Indian language identification for mismatch utterance length
نویسندگان
چکیده
Gaussian mixture model-universal background model (GMM UBM) supervectors are used to identify spoken Indian languages. The calculated from short-time MFCC, its first and sec derivatives. UBM builds a generalized language model, mean adaptation transforms it duration normalized language-specific GMM. Multi-class support vector machine artificial neural network classifiers labels the supervectors. Experimental evaluations performed using 30 speech utterances nine languages comprised five Indo-Aryan four Dravidian languages, extracted all India radio broadcast news data-set. Eight smaller data-sets were manually derived study effect of training test mismatch. In mismatch conditions, identification accuracy decreases with decrease in train utterance duration. Investigations showed that 32-mixture ANN classifier has optimal performance.
منابع مشابه
Phonotactic Model for Spoken Language Identification in Indian Language Perspective
Indian Languages are Indo-Aryan being influenced by Sanskrit or Dravidian being influenced by Tamil. Dravidian Languages have the influence of Sanskrit also. All Indian Languages have the influence of Pali language for which the graphemes are being influenced Brahmi. All the Indian languages are phonetic in nature. Every Indian language has its distinctive phone sets. North Indian languages are...
متن کاملa new approach to credibility premium for zero-inflated poisson models for panel data
هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...
15 صفحه اولGmm Supervector for Content Based Music Similarity
Timbral modeling is fundamental in content based music similarity systems. It is usually achieved by modeling the short term features by a Gaussian Model (GM) or Gaussian Mixture Models (GMM). In this article we propose to achieve this goal by using the GMM-supervector approach. This method allows to represent complex statistical models by an Euclidean vector. Experiments performed for the musi...
متن کاملMUESLI: multiple utterance error correction for a spoken language interface
We propose a method for using all available information to help correct recognition errors in tasks that use constrained grammars of the kind used in the domain of Command and Control (CC) systems. In current spoken language CC systems, if there is a recognition error, the user repeats the same phrase multiple times until a correct recognition is achieved. This interaction can be frustrating fo...
متن کاملNew Features for Language Identification Using Gmm
Automatic Language Identification (LID) is the process of identifying the language spoken within an utterance. The challenge that this task presents is that no prior information is available indicating the content of the utterance or the identity of the language. Most of the existing LID systems are based on MFCC feature vectors. This paper introduces the use of new feature extraction approach ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bulletin of Electrical Engineering and Informatics
سال: 2021
ISSN: ['2302-9285']
DOI: https://doi.org/10.11591/eei.v10i2.2861